Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Une approche de localisation de symboles non-segmentés dans des documents graphiques

Identifieur interne : 003832 ( Main/Exploration ); précédent : 003831; suivant : 003833

Une approche de localisation de symboles non-segmentés dans des documents graphiques

Auteurs : Thi-Oanh Nguyen [France, Viêt Nam] ; Salvatore Tabbone [France] ; Alain Boucher [Viêt Nam, France]

Source :

RBID : ISTEX:1D6CA49C4B5727E38B044743F7CD0331ED057E1E

Descripteurs français

English descriptors

Abstract

Dans cet article, nous proposons une méthode de localisation de symboles dans des documents graphiques. Les occurrences du symbole dans un document sont détectées grâce à un processus de vote sur des régions candidates. L’approche repose sur un vocabulaire visuel et afin de réduire la complexité d’appariement d’un symbole avec d’autres nous utilisons le modèle vectoriel et une indexation par un fichier inverse. Cette méthode s’appuie sur un descripteur défini à partir du concept de contexte de forme1 adapté aux points d’intérêt. Ce descripteur est invariant à la rotation, à la translation et aux changements d’échelles. Les résultats expérimentaux sur la recherche de symboles isolés et sur la localisation de symboles non- segmentés dans le document sont très prometteurs.
This paper addresses the problem of symbol spotting for graphic documents. We propose an approach where each graphic document is indexed as a text document by using the vector model and an inverted file structure. The method relies on a visual vocabulary built from a shape descriptor adapted to the document level and invariant under classical geometric transforms (rotation, scaling and translation). Regions of interest (ROI) selected with high degree of confidence using a voting strategy are considered as occurrences of a query symbol. The symbol spotting problem consists in locating all instances of a symbol embedded in documents. The representation of these symbols is not straightforward by using a good shape (symbol) descriptor because they are not isolated from their context. Therefore, a common strategy for symbol spotting consists in decomposing documents into components and in applying a shape descriptor on each of them. A vectorization step is needed for most of the approaches and usually, only symbols which satisfy some conditions are retrieved (eg. convexity, connectivity, closure,…). Our objective is to tackle the problem from a point of view where neither symbol hypothesis nor vectorization step is needed. First of all, we proposed a descriptor to represent graphic symbols and its extension to document level. Then, we exploit a technique based on the concept of visual words for indexing graphic documents and for spotting non-segmented symbols into documents. Finally, we introduce a voting process on the detected ROI in order to locate instances of a query symbol. In order to represent graphic symbols, we propose an adaptive solution based on shape contexts. It consists in adapting shape contexts for points of interest. We use the DoG detector to extract the points of interest that are nearby the junctions of object model at different resolutions. The shape context at each point of interest (CFPI) is normalized by the dominant orientation of the point of interest and the mean distance between the point of interest and contour points to make the CFPI invariant under rotation and scaling. Therefore, a symbol s is described by a set of CFPI at the interest points of s. We use also this descriptor to extract the local information in a graphic document by computing the CFPI in the neighbour region of each point of interest. We define the neighbour region for each point of interest according to its resolution. With the goal of reducing the complexity of on-line matchings (for searching and spotting), we use the concept of visual words for our system thanks to the information pre-computed in the off-line step. A clustering technique is executed on the set of descriptors CFPI, calculated from all documents in the database, to create visual words. As the CFPIs are matched with visual words, we can use indexing and retrieval techniques for text documents on graphic documents. While matching the CFPIs with visual words, we propose to associate one CFPI with several visual words according to its similarities with these visual words. The objective is to reduce the ambiguity for the CFPI which are nearby cluster boundaries. For spotting the instances of a symbol in a document, first, we detect in the document the ROI corresponding to the query symbol and then we execute a voting process on these regions. The ROI detection is based on the relation between the considered keypoint and the bounding box of the query. The centre of each region of interest is voted from the similarity between this region and the query, based on a text retrieval technique (the vector model). This technique uses the appearance frequencies of visual words in each region to make the comparison between the query and ROI in the document. Regions having high values are considered as potential regions containing a symbol instance. We have tested the adaptation of CFPI for graphic symbols on the GREC’03 dataset composed of isolated symbols. This base contains symbols of 50 models with different sizes and orientations. The results obtained with the CFPI and the ^-signature are shown. The two descriptors have similar performance while querying with complete symbols. However, the CFPI is more robust when the goal is to retrieve incomplete symbols. For evaluating the spotting system, our tests are executed on a collection of synthetic documents from the SESYD project. In spite of errors, the results are very promising and show the feasibility of our approach.

Url:
DOI: 10.3166/ts.26.419-431


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr">Une approche de localisation de symboles non-segmentés dans des documents graphiques</title>
<author>
<name sortKey="Nguyen, Thi Oanh" sort="Nguyen, Thi Oanh" uniqKey="Nguyen T" first="Thi-Oanh" last="Nguyen">Thi-Oanh Nguyen</name>
</author>
<author>
<name sortKey="Tabbone, Salvatore" sort="Tabbone, Salvatore" uniqKey="Tabbone S" first="Salvatore" last="Tabbone">Salvatore Tabbone</name>
</author>
<author>
<name sortKey="Boucher, Alain" sort="Boucher, Alain" uniqKey="Boucher A" first="Alain" last="Boucher">Alain Boucher</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:1D6CA49C4B5727E38B044743F7CD0331ED057E1E</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.3166/ts.26.419-431</idno>
<idno type="url">https://api.istex.fr/ark:/67375/HT0-R2BNG0GJ-D/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000668</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">000668</idno>
<idno type="wicri:Area/Istex/Curation">000663</idno>
<idno type="wicri:Area/Istex/Checkpoint">000937</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000937</idno>
<idno type="wicri:doubleKey">0765-0019:2009:Nguyen T:une:approche:de</idno>
<idno type="wicri:Area/Main/Merge">003910</idno>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:inria-00431190</idno>
<idno type="url">https://hal.inria.fr/inria-00431190</idno>
<idno type="wicri:Area/Hal/Corpus">006817</idno>
<idno type="wicri:Area/Hal/Curation">006817</idno>
<idno type="wicri:Area/Hal/Checkpoint">002E18</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">002E18</idno>
<idno type="wicri:doubleKey">0765-0019:2009:Nguyen T:une:approche:de</idno>
<idno type="wicri:Area/Main/Merge">003C99</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Pascal:10-0301319</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000211</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000808</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000193</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000193</idno>
<idno type="wicri:doubleKey">0765-0019:2009:Nguyen T:une:approche:de</idno>
<idno type="wicri:Area/Main/Merge">003C27</idno>
<idno type="wicri:Area/Main/Curation">003832</idno>
<idno type="wicri:Area/Main/Exploration">003832</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="fr">Une approche de localisation de symboles non-segmentés dans des documents graphiques</title>
<author>
<name sortKey="Nguyen, Thi Oanh" sort="Nguyen, Thi Oanh" uniqKey="Nguyen T" first="Thi-Oanh" last="Nguyen">Thi-Oanh Nguyen</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA - Université Nancy 2, Campus scientifique - BP 239 - 54506 Vandoeuvre-les-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
</affiliation>
<affiliation wicri:level="1">
<country xml:lang="fr">Viêt Nam</country>
<wicri:regionArea>Institut de la Francophonie pour l'Informatique MSI, UMI 209 UMMISCO, Hanoi</wicri:regionArea>
<wicri:noRegion>Hanoi</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Tabbone, Salvatore" sort="Tabbone, Salvatore" uniqKey="Tabbone S" first="Salvatore" last="Tabbone">Salvatore Tabbone</name>
<affiliation wicri:level="4">
<country xml:lang="fr">France</country>
<wicri:regionArea>LORIA - Université Nancy 2, Campus scientifique - BP 239 - 54506 Vandoeuvre-les-Nancy</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandœuvre-lès-Nancy</settlement>
</placeName>
<orgName type="university">Université Nancy 2</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
<author>
<name sortKey="Boucher, Alain" sort="Boucher, Alain" uniqKey="Boucher A" first="Alain" last="Boucher">Alain Boucher</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Viêt Nam</country>
<wicri:regionArea>Institut de la Francophonie pour l'Informatique MSI, UMI 209 UMMISCO, Hanoi</wicri:regionArea>
<wicri:noRegion>Hanoi</wicri:noRegion>
</affiliation>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>IRD, UMI 209 UMMISCO, IRD France Nord, Bondy, F-93143</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Île-de-France</region>
</placeName>
</affiliation>
<affiliation></affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j" type="main">Traitement du Signal</title>
<title level="j" type="abbrev">Trait. Signal</title>
<idno type="ISSN">0765-0019</idno>
<idno type="eISSN">1958-5608</idno>
<imprint>
<publisher>Lavoisier</publisher>
<date type="published" when="2009-09">2009</date>
<biblScope unit="vol">26</biblScope>
<biblScope unit="issue">5</biblScope>
<biblScope unit="page" from="419">419</biblScope>
<biblScope unit="page" to="431">431</biblScope>
<biblScope unit="page-count">14</biblScope>
<biblScope unit="ref-count">0</biblScope>
<biblScope unit="fig-count">0</biblScope>
<biblScope unit="table-count">0</biblScope>
</imprint>
<idno type="ISSN">0765-0019</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0765-0019</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Content based retrieval</term>
<term>Graphic document</term>
<term>Interest region</term>
<term>Localization</term>
<term>Pattern recognition</term>
<term>Symbol detection</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Document graphique</term>
<term>Détection de symboles</term>
<term>Localisation</term>
<term>Recherche par contenu</term>
<term>Reconnaissance forme</term>
<term>Région intérêt</term>
</keywords>
<keywords scheme="mix" xml:lang="fr">
<term>localisation de symboles</term>
<term>mots visuels</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="fr">Dans cet article, nous proposons une méthode de localisation de symboles dans des documents graphiques. Les occurrences du symbole dans un document sont détectées grâce à un processus de vote sur des régions candidates. L’approche repose sur un vocabulaire visuel et afin de réduire la complexité d’appariement d’un symbole avec d’autres nous utilisons le modèle vectoriel et une indexation par un fichier inverse. Cette méthode s’appuie sur un descripteur défini à partir du concept de contexte de forme1 adapté aux points d’intérêt. Ce descripteur est invariant à la rotation, à la translation et aux changements d’échelles. Les résultats expérimentaux sur la recherche de symboles isolés et sur la localisation de symboles non- segmentés dans le document sont très prometteurs.</div>
<div type="abstract" xml:lang="en">This paper addresses the problem of symbol spotting for graphic documents. We propose an approach where each graphic document is indexed as a text document by using the vector model and an inverted file structure. The method relies on a visual vocabulary built from a shape descriptor adapted to the document level and invariant under classical geometric transforms (rotation, scaling and translation). Regions of interest (ROI) selected with high degree of confidence using a voting strategy are considered as occurrences of a query symbol. The symbol spotting problem consists in locating all instances of a symbol embedded in documents. The representation of these symbols is not straightforward by using a good shape (symbol) descriptor because they are not isolated from their context. Therefore, a common strategy for symbol spotting consists in decomposing documents into components and in applying a shape descriptor on each of them. A vectorization step is needed for most of the approaches and usually, only symbols which satisfy some conditions are retrieved (eg. convexity, connectivity, closure,…). Our objective is to tackle the problem from a point of view where neither symbol hypothesis nor vectorization step is needed. First of all, we proposed a descriptor to represent graphic symbols and its extension to document level. Then, we exploit a technique based on the concept of visual words for indexing graphic documents and for spotting non-segmented symbols into documents. Finally, we introduce a voting process on the detected ROI in order to locate instances of a query symbol. In order to represent graphic symbols, we propose an adaptive solution based on shape contexts. It consists in adapting shape contexts for points of interest. We use the DoG detector to extract the points of interest that are nearby the junctions of object model at different resolutions. The shape context at each point of interest (CFPI) is normalized by the dominant orientation of the point of interest and the mean distance between the point of interest and contour points to make the CFPI invariant under rotation and scaling. Therefore, a symbol s is described by a set of CFPI at the interest points of s. We use also this descriptor to extract the local information in a graphic document by computing the CFPI in the neighbour region of each point of interest. We define the neighbour region for each point of interest according to its resolution. With the goal of reducing the complexity of on-line matchings (for searching and spotting), we use the concept of visual words for our system thanks to the information pre-computed in the off-line step. A clustering technique is executed on the set of descriptors CFPI, calculated from all documents in the database, to create visual words. As the CFPIs are matched with visual words, we can use indexing and retrieval techniques for text documents on graphic documents. While matching the CFPIs with visual words, we propose to associate one CFPI with several visual words according to its similarities with these visual words. The objective is to reduce the ambiguity for the CFPI which are nearby cluster boundaries. For spotting the instances of a symbol in a document, first, we detect in the document the ROI corresponding to the query symbol and then we execute a voting process on these regions. The ROI detection is based on the relation between the considered keypoint and the bounding box of the query. The centre of each region of interest is voted from the similarity between this region and the query, based on a text retrieval technique (the vector model). This technique uses the appearance frequencies of visual words in each region to make the comparison between the query and ROI in the document. Regions having high values are considered as potential regions containing a symbol instance. We have tested the adaptation of CFPI for graphic symbols on the GREC’03 dataset composed of isolated symbols. This base contains symbols of 50 models with different sizes and orientations. The results obtained with the CFPI and the ^-signature are shown. The two descriptors have similar performance while querying with complete symbols. However, the CFPI is more robust when the goal is to retrieve incomplete symbols. For evaluating the spotting system, our tests are executed on a collection of synthetic documents from the SESYD project. In spite of errors, the results are very promising and show the feasibility of our approach.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
<li>Viêt Nam</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
<li>Île-de-France</li>
</region>
<settlement>
<li>Vandœuvre-lès-Nancy</li>
</settlement>
<orgName>
<li>Université Nancy 2</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Grand Est">
<name sortKey="Nguyen, Thi Oanh" sort="Nguyen, Thi Oanh" uniqKey="Nguyen T" first="Thi-Oanh" last="Nguyen">Thi-Oanh Nguyen</name>
</region>
<name sortKey="Boucher, Alain" sort="Boucher, Alain" uniqKey="Boucher A" first="Alain" last="Boucher">Alain Boucher</name>
<name sortKey="Nguyen, Thi Oanh" sort="Nguyen, Thi Oanh" uniqKey="Nguyen T" first="Thi-Oanh" last="Nguyen">Thi-Oanh Nguyen</name>
<name sortKey="Tabbone, Salvatore" sort="Tabbone, Salvatore" uniqKey="Tabbone S" first="Salvatore" last="Tabbone">Salvatore Tabbone</name>
<name sortKey="Tabbone, Salvatore" sort="Tabbone, Salvatore" uniqKey="Tabbone S" first="Salvatore" last="Tabbone">Salvatore Tabbone</name>
</country>
<country name="Viêt Nam">
<noRegion>
<name sortKey="Nguyen, Thi Oanh" sort="Nguyen, Thi Oanh" uniqKey="Nguyen T" first="Thi-Oanh" last="Nguyen">Thi-Oanh Nguyen</name>
</noRegion>
<name sortKey="Boucher, Alain" sort="Boucher, Alain" uniqKey="Boucher A" first="Alain" last="Boucher">Alain Boucher</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 003832 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 003832 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:1D6CA49C4B5727E38B044743F7CD0331ED057E1E
   |texte=   Une approche de localisation de symboles non-segmentés dans des documents graphiques
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022